TAO: Facebook's Distributed Data Store for the Social Graph
نویسندگان
چکیده
We introduce a simple data model and API tailored for serving the social graph, and TAO, an implementation of this model. TAO is a geographically distributed data store that provides efficient and timely access to the social graph for Facebook’s demanding workload using a fixed set of queries. It is deployed at Facebook, replacing memcache for many data types that fit its model. The system runs on thousands of machines, is widely distributed, and provides access to many petabytes of data. TAO can process a billion reads and millions of writes each second.
منابع مشابه
Scalable graph analytics with GRADOOP
Many Big Data applications in business and science require the management and analysis of huge amounts of graph data. Previous approaches for graph analytics such as graph databases and parallel graph processing systems (e.g., Pregel) either lack sufficient scalability or flexibility and expressiveness. We are therefore developing a new end-to-end approach for graph data management and analysis...
متن کاملHermes: Dynamic Partitioning for Distributed Social Network Graph Databases
Social networks are large graphs that require multiple graph database servers to store and manage them. Each database server hosts a graph partition with the objectives of balancing server loads, reducing remote traversals (edge-cuts), and adapting the partitioning to changes in the structure of the graph in the face of changing workloads. To achieve these objectives, a dynamic repartitioning a...
متن کاملAn Effective Method for Utility Preserving Social Network Graph Anonymization Based on Mathematical Modeling
In recent years, privacy concerns about social network graph data publishing has increased due to the widespread use of such data for research purposes. This paper addresses the problem of identity disclosure risk of a node assuming that the adversary identifies one of its immediate neighbors in the published data. The related anonymity level of a graph is formulated and a mathematical model is...
متن کاملGRADOOP: Scalable Graph Data Management and Analytics with Hadoop
Many Big Data applications in business and science require the management and analysis of huge amounts of graph data. Previous approaches for graph analytics such as graph databases and parallel graph processing systems (e.g., Pregel) either lack sufficient scalability or flexibility and expres-siveness. We are therefore developing a new end-to-end approach for graph data management and analysi...
متن کاملThe Effect of Information Rate and Store Environment on Purchasing Value; Analysis of the Role of Confusion and Motivational Tendency
Objective A great number of customers spend much more time than expected on shopping because of a lot of reasons like variety in products. Thus, they may feel confused and disappointed. Such confusion can influence purchasing procedure and determine purchasing behavior. Such customers fail to purchase wisely and may face difficulty choosing appropriate and reasonable products. However, if the ...
متن کامل